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Abstract 

The c-Balanced Separator problem is a graph-partitioning problem in which given a 
graph G, one aims to find a cut of minimum size such that both the sides of the cut have at least 
cn vertices. In this paper, we present new directions of progress in the c-Balanced Separator 
problem. More specifically, we propose a new family of mathematical programs, which depends 
upon a parameter e > 0, and extend the seminal work of Arora-Rao-Vazirani (ARV) [3] to show 
that the polynomial time solvability of the proposed family of programs implies an improvement 
in the approximation factor to O ^log 3+e n^ from the best-known factor of 0(\/log n) due to 

ARV. In fact, for e = 1/3, the program we get is the SDP proposed by ARV. For e < 1/3, 
this family of programs is not convex but one can transform them into so called concave 
programs in which one optimizes a concave function over a convex feasible set. The properties 
of concave programs allows one to apply techniques due to Hoffman [10] or Tuy et al [17] to solve 
such problems with arbitrary accuracy. But the problem of finding of a method to solve these 
programs that converges in polynomial time still remains open. Our result, although conditional, 
introduces a new family of programs which is more powerful than semi-definite programming in 
the context of approximation algorithms and hence it will of interest to investigate this family 
both in the direction of designing efficient algorithms and proving hardness results. 



1 Introduction 



Graph partitioning is a problem of fundamental importance both in practice and theory. Many 
problems belonging to the several areas of computer science namely clustering, PRAM emulation, 
VLSI layout, packet routing in networks can be modeled as partitioning a graph into two or more 
parts ensuring that the number of edges in the cut is "small" . The word "small" doesn't refer to 
finding the min-cut in the graph as it doesn't ensure that the number of vertices in both sides of the 
cut is large. To enforce this balance condition one needs to normalize the cut-size in some sense. 
For the known notions of normalization like conductance, expansion and sparsity, finding optimal 
separators is NP-hard for general graphs. Hence, the objective is to look for efficient approximation 
algorithms. 

Because of the huge amount of work done to design good approximation algorithm for these 
problems, graph partitioning has become one of the central objects of study in the theory of geomet- 
ric embeddings and random walks. The first approximation algorithm for Graph Conductance 
came out of the study of the Reimannian Manifolds in form of the well known Cheegar's Inequality 
[6] which says that if $(G) is the conductance of the graph and A is the second largest eigen- 
value of graph Laplacian then 2<I>(G) > A > &(G) 2 /2. Because of the quadratic factor in the 
lower bound, the true approximation is <j>^3y which in worst case can be il(n) in worst case. The 
first true approximation algorithm for Sparsest Cut and Graph Conductance was designed 
by Leighton and Rao [H] whose approximation factor was O(logra). This also gave an O(logra) 
pseudo- approximation algorithm for c-Balanced Separator. This algorithm is referred to 
as a pseudo-approximation algorithm because instead of returning a c-balanced cut, it returns a d- 
balanced cut for some fixed d < c whose expansion is at most 0(log n) times the optimum expansion 
of best c-balanced cut. Their algorithm was based on an LP framework motivated from the idea of 
Multi-commodity flows. Their main contribution was to derive an approximate max-flow min-cut 
theorem corresponding to multi-commodity flow problem and the sparsest cut. Subsequently, a 
number of results were discovered which showed that good approximation algorithms exist when 
one is considering extreme cases such as the number of edges in the graphs is either very small or 
very large. In fact, it is known for planar graphs one can find balanced cuts which are twice as op- 
timal [8j and for graph with an average degree of Q(n), one can design (1 + e)-factor approximation 
algorithms where e > with running time polynomial in input size [2] (such an algorithm is called 
a Polynomial Time Approximation Scheme or PTAS). After 16 years the approximation factor of 
O(logn) was improved to O(ylogn) in a breakthrough paper by Arora, Rao and Vazirani. Their 
algorithm is based on semi-definite relaxations of these problems. The techniques and geometric 
structure theorems proved in their paper has subsequently led to breakthroughs in the field of 
metric embeddings. The basic philosophy behind these approximation algorithms is to embed the 
vertices of the input graph in an abstract space and derive a nice cut in this space. In the linear 
programming approach one uses this abstract space as the l\ metric |14 |, I15 |. [21] . In the semi-definite 
programming framework used in [1] one embeds the vertices on the surface of an n-dimensional 
unit sphere such that they form an l\ metric. The l\ metric on the unit sphere translates into 
saying that for any three vectors the angle subtended by any two among these at the third one is 
acute. One of the major tools used in this paper is the phenomenon of measure concentration on 
unit spheres. 

Hardness Results: Graph partitioning problems like Sparsest Cut and Balanced Sepa- 



RAT OR are considered to among the few NP-hard problems which have resisted various attempts 
to prove inapproximability results. After the result of ARV, there has been a lot of impetus to- 
wards proving lower bounds on approximation factors. It has been shown by Ambuhl et al pQ 
that Sparsest Cut can't have a PTAS unless NP-complete problems can be solved in random- 
ized sub-exponential time. Because of the strong connections between semi-definite programming 
and the Unique Games Conjecture (UGC) of Khot [12J, certain inapproximability results are 
also known which assume UGC. More specifically, Khot and Vishnoi [13] show that UGC implies 
super-constant lower bounds on the approximation factor. In the following year, Devanur et al [7] 
showed that the integrality gap of the SDP relaxation of Arora-Rao-Vazirani is Q (log log n) thereby 
disproving the original conjecture of ARV that the integrality gap of their SDP relaxation is atmost 
a constant. This result did not rely upon the UGC. The recent progress towards designing efficient 
and good solutions to Unique Games has also been motivated from designing a reduction from 
Unique Games to Sparsest Cut [3]. 



1.1 Concave Programming 

In order to define Concave Programming one first needs to define a concave function. A concave 
function is the reverse of a convex function. Formally, a function / : R^ — > R with domain dom/ is 
said to be concave if dom/ is convex and for all x, y G dom/, f(Xx + (l — X)y) > A/(x) + (1 — A)/(y) 
for all A E [0 — 1]. Therefore, / is concave iff — / is a convex function. Based on this definition 
one defines concave programming as a form of mathematical programming in which one optimizes 
a concave function over a convex feasible set. More formally, a concave programming problem can 

where C is a convex set in R rf and / is a concave function. 



be written as 



min/(x) 



Concave programming covers a broad range of non-linear global optimization problems which 
includes the well-known DC(Difference of Convex Functions) programming . Due to its well- 
structured nature and wide applicability in economic problems as well as various other practical 
problems like allocation-location, water storage, standardization etc. [T7], there has been a lot of 
work in the field of optimization towards designing algorithms for various concave programming 
problems. One of the key properties of concave programming being exploited in these algorithms is a 
result that says that for every concave programming problem there is an extreme point of the convex 
feasible set C which globally minimizes the optimization problem. The first algorithm for concave 
programming was designed by Tuy [TH] in a restricted scenario when the feasible set is a polytope. 
A more general case, when the feasible set is convex but not necessarily polyhedral, was solved by 
Horst [TT] and subsequently by Hoffman [10], Tuy and Thai |19| . General concave programming is 
NP-hard as {0, l}-integer programming can be cast as a concave program. There has been work 
towards designing efficient algorithms for some special class of concave programming. It has also 
been shown that some concave programs problems pertaining to Production- Transportation Prob- 
lems can infact be solved in strongly polynomial time [20]. A comprehensive list of works done in 
concave programming can be found in Vaserstein's homepage [22]. 

In this work, we introduce the use of a new family of concave programs towards designing an 
improved approximation algorithm for the c-BALANCED SEPARATOR problem. 



1.2 Contributions and Outline 



In section 2, we formally introduce the notions of sparsity and balanced cuts and sketch the Semi- 
Definite relaxation for c-Balanced Separator of ARV. We then start section 3 by introducing 
a family of relaxations for c-Balanced Separator which is generated by a parameter p > 0. In 
section 4, using the techniques from [3], we show that one can improve the approximation factor 
to O ^log( 1+ 2)/ 3 nj if the proposed family of programs can be solved (by solving we mean getting 

a (1 + e)-approximate answer) in polynomial time. Our result, although conditional, proposes 
new directions of progress on this problem and also a family of optimization problems which are 
more powerful than semi-definite programs in the context of approximation algorithms. Then in 
Section 5 show that one can transform this family of programs into a concave program, a form 
of mathematical programming in which one seeks to minimize a concave function over a convex 
feasible set. There are a number of algorithms which can solve such programs with arbitrary 
accuracy |1CH [TT] , although one is not guaranteed to achieve a polynomial time convergence using 
these algorithms. Since this family is a new form of mathematical programming that is being 
used in an approximation algorithm, progress both in the direction of hardness and algorithms will 
provide more insights into the nature of these concave programs and can potentially lead us to 
optimal inapproximability results for various graph-partitioning problems. We end the paper with 
Section 6 in which we present conclusions and open problems. 



2 Preliminaries 

We now define the two versions of balanced graph partitioning problem namely the Sparsest 
Cut and c-Balanced Separator. It is well known that upto constant factors approximating 
other versions of graph partitioning like like Graph Conductance and Uniform Sparsest Cut 
are equivalent to approximating the Sparsest Cut. Although in this paper, we will mainly be 
concerned with the c-Balanced Separator problem. 
Sparsest Cut 

Given a graph G = (V,E) with \ V\ = n, \E\ = m, for each cut (S,S) define sparsity of the cut to 
be the quantity A(S) = jgi ■ The sparsest cut problem is to find a(G) where 
a(G) = min A(S). 

ScV,\S\<n/2 

c-Balanced Separator 
Given a graph G = (V,E) with \V\ = n, \E\ = m, the c-Balanced Separator problem is to find 
aJG) where a c (G) = min E(S,S). 

ScV,cn<\S\<(l-c)n 



2.1 SDP Relaxation for c-Balanced Separator 

Unifying the spectral and the metric based (linear programming) approaches, ARV used the fol- 
lowing SDP relaxation to get an improved (pseudo)-approximation algorithm for the c-Balanced 
Separator. Let us call this program SDPbs, 



1 In [4] c-BALANCED Separator is defined as the minimum sparsity of c-baianced cuts, we wili be working with 
a definition which upto constant factors is equivalent to their definition 
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i,jeE 
\\vi\\ 2 = 1 Vi 
IK - Vj\\ 2 + - v k \\ 2 > \\vi - v k \\ 2 Vi,j,k 
- VjH 2 > 4c(l - c)n 2 

It is easy to see that this indeed is a vector program (and hence an SDP) and is a relaxation 
for the c-Balanced Separator problem. To show that this is a relaxation we have to show that 
for every cut we can get an assignment of vectors such that all the constraints are satisfied and the 
value of the objective function is the size of the cut. Given a cut (S, S) if one maps all the vertices 
in S to a unit vector n and the vertices in S to -n then the value of the function is indeed the 
cardinality of E(S, S). The main idea behind their algorithm is to show that for any set of vectors 
which satisfy the constraints of the SDP there always exist two disjoint subsets of "large" size such 
that for any two points belonging to different subsets the squared Euclidean distance between them 

is atleast £1 ( ^/\l gn ) ' same idea * s a ^ so use d to get an improved approximation algorithm for 
Sparsest Cut in [2]. Subsequently, this key idea has crucially been used in various other SDP 
based approximation algorithms and in solving problems related to metric embeddings. 



3 A New Relaxation for c-Balanced Separator 

Consider the following family of optimization problems which depend on a parameter p > 0. This 
family is essentially an extension of the semi-definite program proposed by ARV. Throughout the 
paper we will use |.| to represent the 1% norm. Let us call this family of programs F^g- 

min — \\ v i ~ v j\\ P 
i,jeE 
\\vi\\ 2 = 1 Vi 
IK - Vj\\ p + \\vj - v k \\ p > \\vi - v k \\ p Vi,j,k 

y~] \\vi - Vj\\ 2 > 4c(l - c)n 2 
i,jt=E 

Note that for p = 2 this is the SDP relaxation proposed by ARV. For p = 1, we are mapping 
the points onto a unit sphere, therefore we do not have to force the additional triangle inequality 
constraint of I2 metric. The same mapping described for SDPbs of the vertices of the graph onto 
the unit sphere allows us to conclude that each program in this family is also a relaxation for 
c-Balanced Separator. We will show that the techniques used in [3] for lower bounding the 
optimum value of their semi-definite program can be extended in this case as well by appropriately 
modifying the ingredients of Theorem 1 of their paper. Under the assumption that we can solve 
Fg S in polynomial time for any p, < p < 2, we are able achieve an approximation factor of 

O ^log3 + f n^j. We first show how to modify the results of [4], thereby reducing the problem of 
obtaining an improved approximation algorithm to that of finding a polynomial time algorithm for 
solving the family of programs mentioned above. 



4 ARV Proof Modifications 



We first modify the definition of A-separated sets 

Definition 1 (A-Separated Sets). Two sets of vectors in M. d , S and T are said to be A-separated 
if for all v £ S and u £ T \\v — u\\ p > A. 

Definition 2. For p > 0, a set of vectors in M. d is said to be a unit c-spread I?, representation if 
they satisfy the last three constraints in the program F^ s . 

Under the new definition of A-separated sets the main theorem of ARV can be modified in the 
following way: 

Theorem 1. For every d > 0, there are constants c, b > such that every c-spread unit-l?, rep- 
resentation with n points contains A-separated subsets S,T of size c'n, where A = felog - '- 1 ^)/ 3 n. 
Also, there is a randomized polynomial-time algorithm for finding these subsets S, T. 

This theorem immediately allows us to conclude the following result. 

Theorem 2. Given a graph G = (V,E), if the program F^ s can be solved in polynomial time 
for a fixed p, then there exists a randomized O (log( 1+ z^ 3 -pseudo approximation algorithm for 
c-Balanced Separator. 

Proof. (Sketch) For a fixed p, let U = {u\,U2, ■ ■ ■ ,u n } is the optimum solution to the program 
Fgg with the optimum value value as O. We construct the weighted graph G' on the vertex 
set of the original graph, s.t. for every edge (vi,Vj) we impose a weight of ^p\\vi — Vj\\ p . Now 
apply the algorithm of Theorem [3] to obtain two subsets S' ,T' of U of size at least c'n which are 
A = 61og~( 1+ 2)/ 3 n-separated. Let S, T be the corresponding sets of vertices in V. Pick a number r 
randomly uniformly from the range [0-A] and report the cut (V r , V r ) as the answer where V r is the 
set of all vertices within distance r from S. Performing a similar analysis as in Corollary 2 of [3], 
it is easy to show that with high probability (V r ,V r ) is a 0(log( 1+ 2)/ 3 n)-approximate c'-balanced 
cut. □ 

4.1 Proof of Theorem Q] 

Given a unit c-spread I?, representation of vectors, the algorithm to find two A = &log~( 1+ 2)/ 3 n- 
separated sets needs a small modification over the Set-Find algorithm of ARV which is presented 
as MODIFIED-Set-Find. Notice that the modification is made at the last step when the algorithm 
is discarding pairs. 

In order to prove our claim for MODIFIED-Set-Find we will borrow the definitions of (a, 5, c')- 
matching cover, (a, <5)-uniform matching cover and (e, <5)-cover directly from [3]. Among these we 
will only reproduce the definitions of (a, 5, c')-matching cover and (e, <5)-cover. The basic idea is 
that the all these notions of covers do not depend on the triangle inequality of l\ metric and hence 
they also make sense for l p representations. 

Definition 3. For a set V of vectors, a (a, 5, c')-matching cover is a set of (partial) matchings A4 
such that for at least 5 fraction of directions u there exists a matching M u £ M with size at least 
c'n such that for each pair (vi,Vj) in the matching (u, — vj,u) > 1aj\fd. 



Input: A set of vectors V = {vi,V2, ■ ■ ■ v n } in M. d which form a unit c-spread 1% 

representation and parameters A and a. 
Output: Two sets S and T which are A-separated with the desired balance d . 

1. Pick a random unit vector u. 

2. Let ffc be the vector that realizes the median of the values taken by (vi,u) for 
i = 1, 2 ... n and let m be the median value. 

3. Let S' be the set of vectors in V satisfying (v{,u) > m + ^= and T' be the 
vectors in V which satisfy (vi,u) < m — ^= . 

4. If \S'\ < 2c'n or \T'\ < 2c'n, HALT. Otherwise remove all the vectors E 5' 
and Uj 2 E T' which satisfy — Vi 2 || p < A. 

Return: Remaining sets as S and T. 

Algorithm 1: MODIFIED-Set-Find 

Let M be the multi-graph obtained by the union of all the matchings in A4. 

Definition 4. A set of vectors w\,W2, ■ ■ ■ ,w n is is said to be an (e,5)-cover if \\wi\\ < 1 for all i 
and for at least 5 fraction of the directions there exist i E [n] (u)i,u) > e. A set of vectors if said to 
(e, 5)-cover a point x, if the set of vectors {x — Wi\i E [n]} is a (e, 5)-cover. 

The most important thing to note is that the well-separated constraint (X^jjeB \\ v i ~ v j\\ 2 — 
4c(l — c)n 2 ) is the same for both SDPbs and Fg S . This allows us to conclude that Lemmas 
3-7 of [1] all hold for any set of unit c-spread l^ representations as well. Only Theorem 8 of [1] 
needs considerable changes which we present as Theorem [3j For the sake of completeness we will 
reproduce the necessary ingredients used in the proof of Theorem 8 of [1] namely the definition of 
fe-core and Lemma 7. But before going into the proof of our version of Theorem 8, let us recall the 
behavior of projection of a random unit vector onto a fixed vector. 

Lemma 1. If v is a vector of length I in M. d and u is a randomly chosen unit vector 

• forx< 1, Pr{| (v,u) | < < 3x. 

• forx< Pr{| (v,u) | > ^} < e~ x2 ' 4 . 

Definition 5. Given a set of n points about that is (o~,6,c') matching covered by M with associated 
matching graph M, define v to be in the k-core, Sk if v is (^^ ; 5)- covered by points which are 
within k hops of v in the matching graph. 

The following lemma (Lemma 7 of ARV) captures an important property of matching covers. 
A crucial result used in its proof is Levy's iso-perimetric inequality and measure concentration on 
spheres [TB] . 



Lemma 2. For every set ofn points that is (a, 5, c')-matching covered by A4 with associated match- 
ing graph M , there are positive constants a = a{5, c') and b = b(5, c') such that for every k > 1 one 
of the following holds: 

1. \S k \ > a k n. 

2. There is a pair with distance at most k in the matching graph M such that ||t>j — Vj\\ > 

The following lemma can be used to prove the main theorem which shows that the algorithm 
MODIFIED-Set-Find succeeds with constant probability. 

Theorem 3. The Modified -Set- Find algorithm finds a A-separated set for a c-spread K repre- 



p 



sentation with constant probability for A = ©(log' 3 n) where (5 = — g 2 -. 

Proof. If MODIFIED-Set-Find fails with probability 1 — 5, then according to the definition of 
matching covers it can be shown that, the set of deleted points will be (a, 5/2, c')-matching covered. 
Let M be the associated matching graph. This implies that we can use Lemma [2] for the set of 
points. We will show that in such a situation both the cases of Lemma [2] do not hold which in turn 
implies that MODIFIED-Set-Find does not fail with high probability. We first start with dispensing 
the case 2 of Lemma Since the points lie in a Zg metric and vi and Vj are within /c-hops in the 
corresponding matching graph M we will have — Vj\\ p < kA which implies \\vi — Vj\\ < tykA. 

Now let us choose k = where r = tttt and a = n Under this choice of k, one can verify 
A 1+ f (h+p 

that > -\/kA which will lead us to a contradiction. 

Now we consider the case 1 of Lemma[2j This says that the number of vectors which are \) 
covered by points within fc-hops of the matching graph M, is at least some constant fraction of 
the total number of points. Consider a point Vi which is in the /c-core as defined above. Let Vj 
be a point that belongs to the set that ^)-covers V{. Now by definition of A;-core with at least 

probability |, (vj — Vi,u) > But because of the fact that the points come from a metric 

with in k hops \\vi — Vj\\ < tfkA. Now using the Gaussian behavior of projections for a vector 

| X 1 1 Dj 

Vi — Vj a randomly chosen unit vector u satisfies, Pr < (vj — Vi,u) > 

Therefore taking x = oM ka — n we get, Pr < (v* —Vi.u) > ^ a ^ v 
*\\Vi VjU ^ 2 \\ Vi - 

b 2a 

If we denote the exponent of e by A, then we have A = 




2(1-0 

16A 2r • A^^ 

/ 2a\ 2 

we have the exponent of a as ( 2 + 2a = 2 — -r= + 2a = 2a + 

V PJ + P + 2 

7 = . Now one can choose a small constant g such that A = ~ — for (3 < 1 

16 log p n 

7 

and 2(i-r) > which can be done because for a fixed p 7 is a constant and we are going to set 

9 2r+ p 

2p(r I 1 — r ) 

2/3(r + = 1- We therefore get the desired probability to be atmost e~ 7 ^ ogn ^ P . Putting 



r = j^p; we get the exponent of (logn) as 2(3 ^ - — p -\ 1 +p/ 2 \ _ ^ _ Therefore, if 

1 + | , 

we choose (3 = , we can get this probability as atmost -j. Now, clearly the probability that a 

3 n 
vector Vi is covered by points within /c-hops of matching graph is atmost the probability that there 

exists two points V{ and Vj such that for a random unit vector u, the above event occurs. From the 

above calculation, the probability that such an event occurs for any pair Vi, vj is less that 0{-K) via 

the union bound contradicting the condition of Lemma [2] that this probability is at least 1/2. □ 

Some Discussion on the Result: 

It is not clear whether this method will receive benefits from the stronger version of Lemma [2] of [I] 
in which they prove that for the second case \\vi — Vj\\ > a and use it along with other ideas so that 

their algorithm works even for A = (^j=) • The main reason is that if we try to take a k that is 
of the form chosen in Theorem [3] then we don't get a dependence of r in terms of p and therefore we 
don't get a parametrization of the approximation factor in terms of p. Although one might come up 
with a method such that the above mentioned result can also be used to get an improvement over 
this bound of A. One can also ask the question, why did we choose to set (3 (143) as 1 because 
we could have improved the bound on probability if we had chosen a value greater than 1 but in 
that case one can easily notice that we would have to sacrifice with the approximation factor and 
we would have got value a value of (3 which is worse than this value. 



5 A Concave Programming Formulation 

In this section, we consider the family of optimization problems Fg S proposed above and transform 
it into a concave program. This formulation allows us to use the algorithms which have been 
developed to solve a concave program with arbitrary accuracy. We now write F^ s as a program 
with variables as matrix entries and not as d-dimensional vectors. The variables in the new program 
are of the form = (vi,vj). Since all v^s are unit vectors we can write \\vi — Vj\\ as ^J2 — 2 (vi,Vj). 
If we consider the matrix X with ij th entry as we can write the above problem as 

i,jeE 
xa = 1 Vi 

(1 - Xij yi 2 + (1 - Xjk y 1 2 > (1 - Xik y' 2 vij, k 

- x^) > c(l - c)n 2 

i<j 

x y 

where X >z means X is positive semi-definite. 

As we have seen earlier that in order to get an improved approximation factor we must have 
p < 1. Under such a restriction the problem becomes a non-convex feasibility problem as the 
function (1 — Xij) p ^ 2 is not convex. This is a crucial deviation from all the relaxations which 
have been studied till now in the context of approximation algorithms. Because of the non-convex 



nature of the problem we can't use any of the well known techniques like the ellipsoid method 
and the interior point methods and hence can't directly guarantee the polynomial time solvability 
of the program. We therefore transform it into a form which allows us to prove some interesting 
properties. In the above program if we do a change of variable, Zij = (1 — Xij) for all i,j = 1, 2 . . . n, 
the minimization problem looks as the following: 

1 p/2 
mm — t- > z- - 

2p/2 v 

i,jeE 

Z ij + Z jk - Z ik K 



> c(l — c)n 2 

i<j 



za = Mi 

i-zyo 

where 1 is the matrix with all entries as 1. 

Let us call the above program Fg S . This formulation allows us to prove the following lemma: 
Lemma 3. Fg S is a concave program for < p < 2. 

Proof. Since z p l 2 is concave for p < 2 for z > 0, and the sum of concave functions is also concave, 
the objective function is clearly concave. For the constraints defining the feasible set, Y2i<j z ij — 
c(l — c)n? and za = are convex. The constraint 1 — Z y can be shown to be convex as follows: 
Let Z\ and Zi be two matrices corresponding to the variables z^-'s which lie in the feasible set. 
Therefore, they satisfy 1 — Z\ y and 1 — Z2 y 0. Now, consider the line segment for A € [0 — 1] 
XZ\ + (1 — X)Z2 and the matrix 1 — (XZ\ + (1 — A) Z^)- This is positive semidefinite as it can be 
rewritten as A(l — Z\) + (1 — A)(l — Z2) which is a sum of two PSD matrices. 
The only type of constraint left are the triangle inequality constraints. Consider an inequality of this 
type say zfj 2 + z^[ 2 > zf/ 2 . In general, let us look at the region x r +y r > z r for < r < 1. If r = 1/q 
for q > 1 then this region is same as (x 1//<? + y 1 ^ q ) Q > z. Let p\ = (x±,yi, z\) and P2 = (x2,y2, z 2) 
be two points which lie in this region, i.e. [x\^l q + yi 1 ^) 9 > z\ and (x2 1 ^ <? + y2 1 ^ <? ) <? > Z2- To prove 
the convexity of the region we need to show that for any AG [0 — 1], (Xx\ + (1 — X)x2, Xyi + (1 — 
A)y2, Xz\ + (1 — X)z2) also lies inside the region for all such points p\ and p2- Therefore, we have 

( - -\ q 

to show Xzi + (1 — A)^2 < ( (Xxi + (1 — X)x2) q + (Xyi + (1 — X)y2) q ) ■ Thus we will be done if 

we show X(xi 1 /" + yi^if + (1 - X)(x 2 1/q + y 2 1/q ) q < ((Axi + (1 - X)x 2 ) q + (Xyi + (1 - A)y 2 ) «) 

/ 1 1 \ <? 

which is equivalent to proving that the function f(x,y) = [x q + y \ is concave. We will prove 
this by showing that the Hessian of this function is negative-definite for all x, y. We now compute 
the entries of the Hessian matrix. The following calculations are easy to verify, 

dx 

5V 

dy 2 




1 1 

yxi 



In order to show that the Hessian is negative-definite we have to show that for any a, (3 EM, 
the following expression is always non-positive for all x, y > (for x, y as the derivatives do not 
exist): 



a 



dx 2 
g-1 

q 

q-l 

q 

q-l 

q 



dy 2 
a 2 
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.x 1 
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dxdy 
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1 I \ <? 
xi + yi 
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which is non-positive for all a, (3 This proves that the region x p l 2 + y p l 2 > z p l 2 is a convex set for 
all < p < 2. Hence the intersection of all the triangle inequality constraints is also convex. □ 



6 Conclusion 

In this paper, we introduced a new family of mathematical programs inspired from the well-known 
semi-definite program of ARV that promises a O ^log3 +e n^ pseudo-approximation algorithm for 

c-Balanced Separator under the condition that the family of programs can be solved in poly- 
nomial time. Since this family provably gives better approximation guarantees than the celebrated 
linear and semi-definite relaxations of Leighton-Rao [Hj and Arora-Rao-Vazirani [4] respectively, 
investigation both in the direction of polynomial time solvability or hardness will be highly in- 
teresting. The formulation of the proposed family of programs into a well-structured form of 
mathematical programming called concave programming also gives us hope that for many of the 
problems for which optimal approximation factors are not known one can possibly rely upon some 
"nice" programs which are although not convex but can be potential candidates for polynomial 
time solvability. Given that some algorithms for solving concave programs are easy simple to com- 
prehend, it would be also interesting to know whether one can analyze their runs on Fg S and prove 
polynomial time convergence. Another area of investigation could be investigating the links of this 
family of programs with the UGC which has been able to show optimality (close to optimality) of 
various approximation algorithms based on SDP. 
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